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MATTHEW FICKUS AND DUSTIN G. MIXON 

Abstract. Given a channel with additive noise and adversarial erasures, the task is to 
design a frame that allows for stable signal reconstruction from transmitted frame coeffi- 
cients. To meet these specifications, we introduce numerically erasure-robust frames. We 
first consider a variety of constructions, including random frames, equiangular tight frames 
and group frames. Later, we show that arbitrarily large erasure rates necessarily induce nu- 
merical instability in signal reconstruction. We conclude with a few observations, including 
some implications for maximal equiangular tight frames and sparse frames. 



1. Introduction 

Modern communication networks are rooted in both information theory and algebraic 
coding theory. In these contexts, after deciding on a probabilistic noise model for a given 
communication channel, one chooses an appropriate error-correcting code to achieve reliable 
communication with a maximal information rate. For linear codes in particular, encod- 
ing and decoding reduce to problems in linear algebra over finite fields. Beginning with 
Goyal et al. [16], finite frame theorists have studied the generalizations of these problems 
to real and complex vector spaces. This generalization allows one to use certain mathemat- 
ical tools, such as matrix norms and condition numbers, which are not well-defined in the 
finite-field setting. 

This paper is concerned with a channel characterized by additive noise and adversarial 
erasures. We encode a signal x G C A/ using inner products (x, f n ) with members of a spanning 
sequence of vectors {/ n }^Li ^ C M ; such a sequence is called a frame. In transmitting these 
inner products, we expect additive noise due to various phenomena such as atmospheric 
interactions or round-off error. If these were the only sources of noise, then it would be 
reasonable to reconstruct the original signal by applying the Moore-Penrose pseudoinverse. 
To be precise, letting F denote the M x N matrix whose columns are the / n 's, we transmit 
y = F*x. At the receiver, an estimate of x is then found by computing 

x = ((FF*y 1 F)(y + e )=x + (FF*) _1 Fe, 

where e is additive noise. Assuming the channel has a "signal-to-noise ratio" of R = ||y||/||e||, 
we can estimate how the size of the estimate error (FF*)~ 1 Fe compares with the size of the 
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original signal x. Indeed, \\(FF*) 1 Fe\\ < ^\\x\\, where 

C:= 8UP R . WiEETM = sup E^, li(^;);^iL W | 2 || (Fn - lF || 2 . 

xec M \{o} \\ x \\ xec M \{o} ll x ll ll e ll 

eec^Uo} eec^uo} 

Here, C is the condition number of F, denoted Cond(F), which is equal to the ratio of the 
greatest singular value of F to its smallest one. From this perspective, the best possible 
frames are those with Cond(F) = 1, a fact which occurs precisely when FF* = AIm for 
some A > 0; such F's are called tight frames. 

We consider channels that, in addition to additive noise, suffer from erasures. To be 
precise, the transmitted signal is a sequence of inner products: F*x = {(x, f n )}n=i- Like [16j, 
we consider channels which completely delete some of these inner products and add noise to 
the remaining ones. However, whereas [IB] focuses on average reconstruction performance, 
we instead follow [TT] and [TTj by focusing on worst-case reconstruction performance. In 
particular, by considering worst-case performance, we design frames which are robust against 
the erasure of any fixed number of inner products. Such frames could be particularly useful 
in situations where an adversary is actively deleting our most useful frame coefficients, i.e., 
active jamming. We say that such frames are robust against adversarial erasures. 

To design such frames, we first acknowledge that we cannot reconstruct the M-dimensional 
signal x without at least M inner products. As such, we must impose some constraint on the 
adversary. For the highly constrained adversary, Casazza and Kovacevic pTTJ show that tight 
frames of unit-norm vectors, called unit norm tight frames, are optimally robust against one 
erasure. Soon thereafter, Holmes and Paulsen [TTj showed that equiangular tight frames— 
explicitly defined in the next section — are optimal for two erasures. To combat the highly 
destructive adversary, Puschel and Kovacevic [20] propose frames which are maximally robust 
to erasures in the sense that the original signal can be recovered from any M of the N inner 
products. Other constructions of such maximally robust frames are given in [TJ, where they 
are dubbed full spark frames. It remains unclear whether the deletion of any N — M frame 
coefficients will allow for numerically stable reconstruction; this is an important distinction 
between invertible submatrices — the subject of [TJ ED] — and well-conditioned submatrices, 
which is our focus here. 

To be clear, in this paper we consider the case where the adversary is only capable of 
removing a proportion p of the N transmitted inner products. Then the remaining (1 —p)N 
inner products correspond to a subcollection of (1 — p)N columns of F, which we require 
to be well-conditioned for our reconstruction to properly combat the additive noise. Since 
erasures occur according to the will of an adversary, as opposed to a random process, we 
must ensure that every subcollection of (1 — p)N columns of F is well-conditioned. This 
leads to the following definition: 

Definition 1. Given p G [0,1] and C > 1, an M x N frame F is a (p,C) -numerically 
erasure-robust frame (NERF) if for every JC C {1,...,N} of size K := (1 —p)N, the 
corresponding M x K submatrix F^ has condition number Cond(i 7 )c) < C . 

The purpose of this paper is to make the first strides in studying NERFs. In the follow- 
ing section, we use a variety of techniques to form different NERF constructions. Taking 
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inspiration from matrix design problems in compressed sensing, we first investigate frames 
whose entries are independent Gaussian random variables. Next, we consider equiangular 
tight frames, with which we get stronger results at the price of higher redundancy in the 
frame. Later, we show how the symmetry of group frames makes them naturally amenable 
to NERF analysis. In Section 3, we report a result on the fundamental limits of NERFs: 
that NERFs cannot stably support erasure rates p which are arbitrarily close to 1. Finally, 
we conclude with a few interesting observations in Section 4. 



2.1. Random frames. The reader may have noticed some similarity between the definition 
of numerically erasure- robust frames and a matrix property which comes from the compressed 
sensing literature: the restricted isometry property (RIP). To be clear, anMxiV matrix F 
is RIP if it acts as a near-isometry on sufficiently sparse vectors, that is, \\Fx\\ ~ ||a;|| for all 
vectors x with sufficiently few nonzero entries |12j . In other words, submatrices F/c composed 
of sufficiently few columns from F have F^F K particularly close to the identity matrix, 
meaning F^F K is particularly well-conditioned. The key difference between NERFs and 
RIP matrices is that well-conditioned NERF submatrices F^ have K :— |/C| > M columns, 
whereas F^ has fewer than M columns in the RIP case. Regardless, in constructing NERFs, 
we can exploit some intuition from the construction of RIP matrices. In particular, the RIP 
matrices which support the largest sparsity levels to date arise from random processes. As 
an example, one may draw the entries independently from a Gaussian distribution of mean 
zero and variance j?; this was originally established in Lemma 3.1 of [13]. What follows is 
the analogous NERF result: 

Theorem 2. Fix e > and pick an M x N frame F by drawing each entry independently 
from a standard normal distribution. Then F is a (p,C) -numerically erasure-robust frame 
with overwhelming probability provided 



This occurs whenever p < 0.1460. That is, the random construction in Theorem [2] is nu- 
merically robust to erasure rates of up to almost 15%. However, approaching a 15% erasure 
rate while satisfying flTJ will admittedly cost a large worst-case condition number C along 
with high redundnacy in the frame. Still, Theorem [2] provides a useful guarantee. For 
example, a Gaussian matrix of redundancy |j = 5 will, with overwhelming probability, be 
robust to 1% erasures with a worst-case condition number of 10. We proceed with the proof: 

Proof of Theorem^ Pick JC C {1, . . . , N} of size K = (1 — p)N. Note the assumption flTJ) 
implies that ^ < 1 — p and so K = (1 — p)N > M. As such, Theorem 11.13 of [H] gives 
bounds on the singular values of the random "tall" K x M matrix Fj^: 



Pr[\^ - y/M-t < a mi n(F£) < a max (F^) < VM + t] > 1 - 2e"* 2/2 Vt > 0. 



2. Constructions 




(1) 



Note that ([!]) requires its right-hand side to be positive, which in turn implies 
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This probabilistic bound on the extreme singular values implies 

r K + y/M + t 



Pr 



Cond(F /c ) < 



K 



M-t 



> 1 - 2e~' I 2 



Vt > 0. 



Taking a union bound over all ( K ) = ( N _ K ) < 



Pr 



3/C s.t. Cond(F/c) > 



K + VM + t 



M-t 



< 
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N-K 



< 2 exp 
= 2 exp 



£ 2 



AT) log 



eN 



t 2 e 
— + Np log - 
2 p 



N-K 
Vt > 



Now pick £ such that C 



namely £ 



IK— J M-t 1 

implies t > 0, and so we may substitute it into 



p 



and simplify the result: 




Pr[3/C s.t. Cond(F,c) > C] < 2 exp 

< 2e-^/ 2 . 

Thus, the probability of F not being a (p, C)-NERF is 0(A^" 
F is a (p, C)-NERF with overwhelming probability. 



(2) 

Note that (ED) 
logp) 



for every fixed a, meaning 

□ 



2.2. Equiangular tight frames. The previous subsection constructed a random family of 
numerically erasure-robust frames by following intuition from known constructions of ma- 
trices with the restricted isometry property. Indeed, state-of-the-art RIP matrices are built 
according to random processes, while deterministic constructions have found less success [5]. 
In this subsection, the analogy between RIP matrices and NERFs will break down, as we 
will construct deterministic NERFs which outperform the random counterparts with much 
larger erasure rates, albeit at the price of high redundancy. 

In [T7], Holmes and Paulsen show that frames of pairwise dissimilar unit-norm vectors 
are robust to two erasures. This dissimilarity is measured in terms of worst-case coherence, 
which is known to satisfy the Welch bound: 



Theorem 3 (Welch bound 

worst-case coherence 



Every M x N frame {f n }n=i °f unit-norm vectors has 



max \(fn,fn')\ > 
n,n'e{l,...,N} 



N-M 



M(N 



Specifically, Proposition 2.2 of [17J gives that minimizers of worst-case coherence are op- 
timally robust to two erasures. For many values of M and N, there exist frames which 
achieve equality in the Welch bound. In fact, a sequence of unit-norm vectors F = {f n }n=i 
achieves the Welch bound if and only if it is an equiangular tight frame (ETF), meaning 
that it is a tight frame (i.e., FF* = AIm) which also satisfies the equiangularity condition 
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that \(f n , f n ')\ is constant over all choices of n ^ n' [22]. Not only are ETFs minimizers of 
worst-case coherence, they also have combinatorial symmetries related to strongly regular 
graphs, difference sets and Steiner systems; these combinatorial structures have each been 
used to build the only general ETF constructions to date [H2, EU EH] . 

In this subsection, we consider an ETF construction based on a particular difference set. 
Let q be a prime power, take M = q + 1 and N = q 2 + q + 1, and consider the trace map 
Tr : ¥ q 3 — > ¥ q defined by Tr(/3) = (3+(3 q +(3 q . Given a generator a of the multiplicative group 
of F ? 3, define the M-element subset M. C r L N by M. = {t : Tr(ct*) = 0}. By construction, 
At has the property that every nonzero member of Zat can be uniquely expressed as the 
difference of two elements of At; this set is called the (N, M, 1) -Singer difference set [18\. 
As shown in [26], any difference set At C Zjv can be used to build an ETF by taking rows 
from the N x N discrete Fourier transform matrix which are indexed by members of Ai and 
then normalizing the resulting columns. This construction has the following guarantee: 

Theorem 4. Take M = q + 1 and N = q 2 + q + 1 for some prime power q, and let F be 
the M x N equiangular tight frame F constructed from the (N, M, 1)-Singer difference set, 
as in [25]. Then F is a (p, C) -numerically erasure-robust frame for every p < | — ^ . 

This result essentially states that such ETFs are numerically robust to erasure rates of 
up to 50%. Compared to the random construction of the previous section, which required 
less than 15% erasures, this is quite an improvement. Certainly, the frame redundancy 
is unbounded in this case since iV scales as M 2 , but the reward is significant. For example, 
such ETFs are robust to 49% erasures with a worst-case condition number of 10. Meanwhile, 
for iV ^> M, Theorem El only guarantees — with overwhelming probability — a worst-case 
condition number of 10 when less than 9% of the frame is erased. 

Proof of Theorem^ Pick some /C C {1, . . . , iV} of size K — (1 — p)N, and let {A^ ;m }m=i 
denote the eigenvalues of F K F^. Taking Sic := max m |^A/c ;m — 1|, we have 

(Cond(^)) 2 = Cond^F*) = A ™ffiffi < ^ (3) 

provided 5/c < 1; if 5/c > 1, then F/c could be rank deficient. Moreover, the fact that F K F^ 
and 1m are simultaneously diagonalizable implies 

M 

S K = %? maX aj1 \^K;m ~ Jf\ 2 [A/C;m ~ Jjf? = ^^Ai F K. F K, ~ J^M?]- (4) 

me{l,...,M} ' * 

m=l 

From here, the cyclic property of the trace and the fact that F has unit-norm columns give 
Tr[(^F* - f I M ) 2 ] = Tr[(i^) 2 ] - f Tr^F*] + §Tr[I M ] 
= Tr[(i^) 2 ] - f Tr[^^] + f 

k&K. k'EK 

Since F is an ETF, the inner products between distinct frame elements achieve equality in the 
Welch bound: |(/ fc ,/ fc ')| 2 = M(N M \) f° r ever y k k'. Applying this to (jSJ) and substituting 
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into (j3J) then gives 

&< M2 (k+k(k in N ~ M R2 \ M(M-1)(N-K) pM(M - 1) 
5/c < 777 # + 1) 77777 77 " 77 = ?7777 77 = 71 77777 77- ( 6 ) 



K 2 \ v J M(N-1) MJ K(N-l) (l-p)(N-l)' 

1 c 2 

2 C 4 +l' 



According to the theorem statement, iV = M 2 — M + 1 and p < \ — -M , 1 , and so 



K ~ 1 - p " (C 2 + l) 2 

Substituting this into (j3J) therefore gives Cond(F/c) < C. □ 

We note that together with the necessary condition < 1 indicate that of all M x N 
ETFs, the above proof technique will only work for those with N = Q(M 2 ) frame elements. 
However, as noted in Proposition 2.3 of [I], M x N ETFs necessarily have iV < M 2 , and 
so the ETFs for which the above proof can demonstrate NERF are asymptotically maximal. 
A long-standing open problem in frame theory concerns the existence of M x iV ETFs with 
iV = M 2 , or maximal ETFs, and it is easy to verify that Theorem H] also holds for this 
conjectured family; to date, these are only known to exist for finitely many M's [3j. As for 
asymptotically maximal ETFs, the difference set construction of Theorem H] is the only such 
infinite family known to the authors. Regardless, a version of Theorem H] holds for every 
family of asymptotically maximal ETFs, which follows directly from ([6]): 

Theorem 5. Every M x N equiangular tight frame with M ^J-i) — a ^ s a C) -numerically 

erasure-robust frame for every p < a ( C a°i^+(c a +i) a • 

Since maximal ETFs are particularly difficult to construct, different fields have turned to 
mutually unbiased bases (MUBs) to fill their need for large frames with low coherence [19, 22J. 
There are several M x M 2 MUB constructions, all of which have the property that the inner 
product between any two columns is of size or 1/a/M (21 [91 [19]. As the Welch bound in 
this case is 1/ \/M + 1, MUBs are "almost" ETFs. It is therefore surprising that the above 
proof techniques fail to show that MUBs are NERFs. To illustrate this fact, we consider the 
MUB version of ©: 

A < j£! ( K + K (K - 1) J- - = M(M — 1) _ M — 1 



K 2 \ v 'MM) K (l-p)M' 

Due to the necessity of 5% < 1, this bound will not be useful unless p < jj. However, even 
in this case, substituting (jTJ) into ([3D gives 

(Cond^)) 2 < < V^Zgg + ^fi. (8) 

Further since < p < A, separately bounding the numerator and denominator gives that 
the right-hand side of OH]) is always at least 2\JM — 1, meaning (jSJ) says very little about the 
worst-case condition number, regardless of the erasure rate. 

It remains to be seen whether this is a true distinction between ETFs and MUBs or is 
instead an artifact of our proof techniques. One way to improve this analysis is to find a better 
bound on the frame potential (jSJ), see [7]. To be clear, we can certainly bound it in general 
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using worst-case coherence, and such a bound is tight whenever the frame is equiangular. 
However, when the frame is not equiangular, this bound is less than optimal. For a better 
bound in the general case, suppose that for every n G {1, . . . ,N}, the distribution of the 
squares of inner products {|(/ n , /n')| 2 }v=i is identical. In this case, let dp G M. N denote the 
common sequence of squared inner products, sorted in nonincreasing order. We can then 
bound the sum in ([5]) by exploiting this structure: 



A' 



EEk^'>i 2 ^E«- ( 9 ) 



keKk'eK k=l 

Combining bounds (TjJ, ([5]) and @ then yields 



«s^(££i<A,/wi 2 -^)<M*(iy> 

v ketc fe'e/c 7 v k=i 



In particular, in order to use (fTOj) to guarantee 5% < 1, we want the average of the K largest 
values of dp[k] to be close to Further note that if F is a unit norm tight frame, which 
necessarily has tight frame constant A = j^, then the average of all values of dp[k] is j^: 

1 N 1 N IN 1 

k=l n=l 



In such cases, using (II 01) to estimate the NERF properties of a given frame reduces to finding 
how quickly (as a function of K) the average of the K largest values of d F [k] converges to 
the average of all of its values. 

With this refined analysis, we can prove that MUBs are actually NERFs. We note that 
the bound (J5J) is identical to the worst-case coherence bound unless K is large, since d F in 
this case has one copy of 1, M(M — 1) copies of -p, and M — 1 copies of [21 HI [19]. Indeed, 
analysis with (Q can only show that MUBs are NERFs when the erasure rate is small: 

Theorem 6. An M x M 2 frame of mutually unbiased bases is a (p,C) -numerically erasure- 
robust frame for every p < fc^W^ATi) ■ 

Note that the above guarantee is not nearly as good as the one we got for ETFs, or 
even for random frames. However, the result is still of some use; for example, when M is 
sufficiently large, removing any 0.96M of the M 2 frame vectors will leave a submatrix of 
condition number smaller than 10. 

Proof of Theorem® Applying (TTUT) to the distribution d F of the M x M 2 MUB yields 
5l < > dp \k] = 1 + M(M - 1) = — i '-. 
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Since K — (1 — p)N and iV = M 2 , we can simplify and apply p < ^+i) 2 (m+i) ^° 

. 2 K pM_ (C 2 - i fM 

K ~ 1 - p ~ {C 2 + 1) 2 (M + 1) - (C 2 - l) 2 

(C 2 - 1) 2 M _ (C 2 - l) 2 

" (C 2 + 1) 2 (M+1)-(C 2 + 1) 2 (C 2 + l) 2 ' 1 ' 

Substituting this into (j3J) therefore gives Cond(Fx;) < C. □ 

2.3. Group frames. In the previous subsection, we demonstrated that mutually unbiased 
bases are NERFs by exploiting an important property: the distribution of the squares of 
inner products {\(f n , fn>)\ 2 }n>=i is identical for every f n . In this subsection, we will consider 
a much larger class of unit norm tight frames that enjoy this identical distribution property: 
group frames. Given a seed vector / G C M and a finite subgroup G of the group of all M x M 
unitary matrices, the corresponding group frame is the orbit {U f}u^G of / under the action 
of this group, though {Uf}u<=G should only be called a frame if the Uf's span. In fact, if 
ll/H = 1, then {Uf}uec wm be a unit norm tight frame provided the group G is irreducible, 
meaning that for any nonzero x G C M the vectors {Ux}ueG necessarily span C M ; for this 
and other interesting facts about group frames, see [23] . Note that for any U, U' G G, 

(Uf,U'f) = (f,U*U>f) = (f,U- 1 U>f). 

Since each U~ l acts as a permutation on G, we conclude that {(Uf, U'f)}u'eG is a permuta- 
tion of {(/, U' f)}u'eG, thereby confirming our above claim that each row of the Gram matrix 
F*F is identically distributed. 

To illustrate the usefulness of group frame ideas in estimating 5k with (jTOl) . we will apply 
it to group frames generated by the symmetric group of the simplex. First, we define a 
(regular) simplex to be any M x (M + 1) matrix \1/ whose (M + 1) x (M + 1) Gram matrix 
is \f r * 1 I f = ^rplM+i — jj^M+u where Ja/+i denotes an (M + 1) x (M + 1) matrix of ones. 
Notice that the spectrum of consists of M copies of and one value of 0; since this 
is a zero-padded version of the spectrum of the M x M frame operator ty^*, we conclude 
that tyty* = ^rpljvf, meaning \l/ is a tight frame. In fact, since the off-diagonal entries of 
\J/*iJ/ are all equal in size (to the Welch bound), ^ is an equiangular tight frame. 

The simplex plays an important role in finite frame theory. Indeed, the Mercedes-Benz 
frame and the vertices of the tetrahedron, being 2- and 3-dimensional realizations of the 
simplex, serve as fundamental examples of frames [7J [23]. Simplices can also be easily 
expressed in higher dimensions by removing the row of l's from an (M + 1) x (M + 1) 
discrete Fourier transform matrix or Hadamard matrix and then normalizing the resulting 
columns. This representation of simplices plays a key role in the construction of Steiner 
ETFs [J5]. In this paper, we are specifically interested in the symmetries of the simplex. 
In general, the symmetry group of a frame is the set of all matrices which, when acting on 
frame elements, permute them. The following result gives a particularly nice description of 
the symmetry group of the simplex: 

Lemma 7. The symmetry group of an M x (M + 1) regular simplex \1/ is the set of all 
matrices of the form U = j^ti^ P^j* ; where P is an (M + 1) x (M + 1) permutation matrix. 
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Proof. The symmetry group of \I/ is the set of all matrices U for which there exists a per- 
mutation matrix P such that U^f — \I/P. Note this implies U^f^f* = \|/P\|/* which, since 
tyty* = ^jIIm, further implies U = jj^^P^J*. In other words, for each member U of the 
symmetry group of there is a unique permutation matrix P such that U^f — $?P. Thus, all 
that remains to be shown is that for each permutation matrix P, the matrix U = -^-j-^P^* 
satisfies U^f = VP. To this end, note 

u* = ^p^v = ^w(fi w - iJM + i) = *p - jfetfpw 

It therefore suffices to show that ^PJm+i = 0. To do this, factor Jm+i as an outer product 
of an all-ones vector with itself, a vector which happens to be preserved by permutations: 
VPJ M +i = VPl M+l l* M+1 = V1 M+1 1* M+1 . Then note that * W = 0: 

||*1m+i|| 2 = 1m+1*** 1 M+1 = _ J! 1 M+l 1 *M+l) 1 M+l = °- D 

From Lemma d we can deduce that the symmetry group of an M x (M + 1) simplex \l/ 
is the symmetric group on M + 1 letters, and so we denote it by Sm+i- We are interested 
in the frames formed by applying the (M + 1)! members of Sm+i to unit vectors. We claim 
that such frames are automatically unit norm tight frames. Moreover, motivated by (ITU]) , 
we further seek the distribution dp of the squared-moduli of the inner products of the frame 
elements with each other. 

Here, it is helpful to note that $* := y/M/(M + is a unitary transformation between 
C M and the M-dimensional orthogonal complement l M +i of the (M+l)-dimensional all-ones 
vector; the proof of this fact is straightforward and is not included here. Indeed, writing any 
unit-norm vector / G C M as / = $g where g G l^+i has \\g\\ = 1, we have inner products 
of the form: 

(f, Uf) = (f, jfeVPWf) = (*7, P®*f) = (9, Pg). (12) 

Moreover, as noted above, our group frame will be tight provided that for any i / the 
following vectors span C M : 

{Ux} UeG = {^P^ x } PeSM+1 = {$P$M Pe5M+1 , 

which is equivalent to having that {Py}pes M +i s P ans Ijyf+i f° r anv nonzero y G l M+1 - This in 
turn is equivalent to showing that z = is the only choice of z G i-M+i for which (z, Py) = 
for all permutations P. To do this, fix any indices n\ ^ n 2 , n s ^ n.4 from {1, . . . , M + 1}, and 
consider the zero inner product (z, P\y) that arises from any permutation P\ which takes n 3 
to n\ and n 4 to n 2 . From this, now subtract the zero inner product from a permutation P 2 
which is identical to Pi, except that it takes n 3 to n 2 and n 4 to n+. 

0=(z,P iy )-(z,P 2 y) 

= z[ni]j/[n 3 ] + z[n 2 ]y[n 4 ] - z[ni]j/[n 4 ] - z[n 2 ]y[n 3 ] 

= (z[ni\ - z[n 2 })(y[n 3 ] - y[n 4 }). (13) 

Now, since ^ y G 1m+i we have that y is a nonzero vector whose entries sum to zero, and 
so in particular there exists indices n 3 and n 4 such that y[n 3 ] — y[n 4 ] 7^ 0. As such, ( TT3]) 
implies that -zfrii] = z[n 2 ] for every choice of n\ ^ n 2l namely that the entries of z are all 
equal. Since z G 1m+i, this means z = as claimed. We summarize these facts below: 
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Theorem 8. Let ^ be an M x (M + 1) matrix whose unit columns form a regular simplex 
in C M . Let f = \J Mj [M + l)\l/g, where g is any unit-norm vector g e C AI+1 whose entries 
sum to zero. Then the group frame 

is a unit norm tight frame of (M + 1)! elements for C M . Moreover, each row of the Gram 
matrix of this frame has entries of the form {(f,Uf)}ueG = {(g, Pg)}p&s M +i- Here, P 
ranges over all (M + 1) x (M + 1) permutation matrices. 



We now use these ideas to construct a frame to be used in conjunction with the bound ( jTUl) . 
where dp[k] denotes the fcth largest value of the form |(/, U f)\ 2 = \(g,Pg)\ 2 . In particular, 
our goal is to find a unit norm vector g e for which the average of the K largest values 
of dp[k] is very close to the average of all of its values: 

Moreover, considering the underlying application of NERFs, we prefer not to transmit as 
many as (M + 1)! frame coefficients to convey an M-dimensional signal. For this reason, we 
seek vectors g which are fixed by a large subgroup of permutation matrices, namely, vectors 
with large level sets; this way, we can get away with only using representatives of distinct 
cosets of this large subgroup. In this paper, we only consider vectors of two level sets, say 

g = ( a,a,...,a , b, b,...,b) . (14) 

L times M+l—L times 

Choosing g in this way guarantees that the corresponding group frame only has { ^) 
distinct elements. Moreover, since each of these unique elements appears the same number 
of times, namely L\(M + 1 — L)\ times, the ( M z |" 1 )-element subframe is still tight. 

To estimate the NERF properties of such frames using (1101) . we first need to find explicit 
expressions for a and b. Here, the condition (g, 1m+i) — implies La + (M + 1 — L)b = 0. 
Combining this with the fact that g has unit norm then gives 



' M + l-L _ / L 

(M + 1)L' ; y (M+ 1)(M+ 1 -L)' 1 ' 

where we take a > without loss of generality. Next, note that (g, Pg) is completely 
determined by the number J of indices n for which g[n] = (Pg)[n] = a. This leads to the 
following calculation: 

(/, Uf) = (g, Pg) = Ja 2 + 2(L - J)ab + (M + 1 + J - 2L)b 2 = ^±^^ . 

Moreover, of the ( M L hl ) distinct Z7/'s in this construction, there are (j) ( M l-j ) w hich pro- 
duce the above inner product, since J of the a's in Pg must align with a's in g, while the 
other L — J a's in Pg align with 6's in g. In the special case where g has L = 2 a's, we have 
a total of ( M 2 ^ 1 ) distinct C//'s, and the distribution of inner products is given by 

1 with multiplicity 1, 

4-3 
2(M-1) 
2 

'Af-1 



{(f,Uf)} = { ww^T) with multiplicity 2(M-1), (16) 



2 with multiplicity \{M - 1){M - 2) 
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As verified below, substituting this fact into flTOj) yields the following result: 

Theorem 9. Pick M > 7 and consider the M x ( A/ 2 +1 ) frame F with columns of the form 
^M/(M + l)^Pg, where ^ is an M x (M + 1) regular simplex and the Pg's are distinct 
permutations of g, which is defined by (TH|) and f[T5]) with L = 2. Then F is a (p,C)- 
numerically erasure- robust frame for every p < (n^TW^d+rj ■ 

The above guarantee bears a striking resemblance to Theorem [6l despite the distribution 
dp being significantly different. Again, while this result is not nearly as good as the ones we 
got for ETFs or random frames, it still gives something; for example, removing any 0.48M 
of the ( M 2 1 ^ 1 ) frame vectors will leave a sub matrix of condition number smaller than 10. As 
one would expect, there are similar NERF results for the frames that correspond to larger 
values of L, but we do not report them here. 

Proof of Theorem Since M > 7, the sizes of the inner products in f|T6|) are nonincreasing, 
and so dp is defined accordingly. Also, taking K — (1 — p)N with p < (h^Wm+T) — m+T' 
we claim that K > 2(M — 1) + 1. Indeed, 

/ 1 \ M 2 
K> (1 )N = > 2(M- 1) + 1, 

where the last inequality follows from M > 7 > 2 + y/2. Since K > 2(M — 1) + 1, then 
applying ( ITU]) to (TTO]) yields 

k=l 

= ML( 1 + 2(M - 1)( M - 3 Y + (K - (2M - 1)) (^) 2 - £ 
(M{M + 1) - 2^\ / M(M 2 - 6M+ 1)\ 

Since K = (1 — p)A^ and = ( A/ 2 I_1 ), we can simplify to get 

2 pM(M 2 - 6M + 1) pM 
^ - n _ nVA/f _ Tvi < 



(l-p)(M- l) 2 ~ 1-p 

From here, p < and ( TTTf) together imply 5^ < , which we substitute into 

(ED to conclude that Cond(F,c) < C. □ 

3. Limiting our expectations 

The previous section gave four different constructions of numerically erasure-robust frames. 
The last three constructions were deterministic, and their proofs hinged on how coherent a 
subcollection of frame vectors can be. In this section, we shed some light on the fundamen- 
tal limits of NERFs by again considering the coherence of frame subcollections. We start 
with the following lemma, which says that a matrix with similar columns will have a large 
condition number: 
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Lemma 10. Take an M x N matrix F with unit-norm columns. Then for every unit vector 

(M -l)\\F*x\\ 2 



x e m M 



(Cond(F)) > 



N - \\F*x\ 



Proof. First, we have A max (FF*) = > ||F*a;|| 2 . Next, take {x m }^ =1 to be some 

orthonormal basis with Xi = x. Then X min (FF*) < ||F*x m || 2 for every m, and so averaging 



over m = 2, . . . , M gives 



M N M 



A min (FF*) < \\F*X m f = J^jYl £ \(*mJn)\ 2 . 



m=2 n=l m=2 

I M 



Since each f n has unit norm and {x m }^ =1 is an orthonormal basis with x\ = x, we continue: 

N - \\F*x\ 



N „ T „„„ „ 2 



A^(FF*)< jj^j;E(l- I I 2 ) 



M — 1 



n=l 

Combining this with our lower bound on A max (-FF*) gives the result. □ 

To be explicit, the lower bound in Lemma [TD] is exceedingly large when the columns of 
F each have a large inner product with x. We now use this lemma to prove the following 
statement on the fundamental limits of NERFs: 

Theorem 11. Take a sequence of real M x N M frames {F M }^ =1 , pick C > 1, and take a 
sequence of erasure rates {pm}m=i suc ^ that 

liminfpM > 1 - 2Q(C), Q(t) := -= / e~ u2/2 du. (17) 
M-j-oc ■ s /2n J t 

Then for all sufficiently large M , Fm is not a (pm, C) -numerically erasure-robust frame. 

Proof. For notational simplicity, we write F = Fm, N = Nm and p = pu- Further let S M_1 
denote the unit sphere in M M . For any x G § M-1 , consider the "polar caps" of the sphere 
about ±x, namely the set B(x) := {y E §> M_1 : \(x,y)\ 2 > jr}. For any such bi-cap, we 
may count the number of frame elements that it contains, namely the cardinality of the set 
B(x) n{/ n }^ =1 . Let Xq denote the point on the sphere whose bi-cap contains the most frame 
elements. By the pigeonhole principle, the fraction of frame elements contained in this bi-cap 
is at least the fraction of its surface area to the surface area of the entire sphere: 



B(x ) n {/„} 



N 

n=l 



> N Area(B(x)) 



Area(S 



M-V 



Assuming for the moment that 1 — p < Area(5(x))/Area(§ M x ), we may take /C to be the 
indices of any K = (1 — p)N of the / ra 's in B(xq) fl {f n }n=i- Then 

keK. 
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and so applying Lemma HO] to the M x K matrix gives 

(Cond(F )? > i^Mltll > ( M -^ _ M-l 2 2 

as claimed. Thus, it only remains to show that 

Area(E(a;)) 

1 " P " Area(S^) (18) 

for sufficiently large M. 

To this end, pick M large enough so that < 1 and take 9 £ (0, |) such that cos 2 = 
Then is the union of both polar caps of angular radius 9 centered at ±x. Using 

hyperspherical coordinates, we find that 

Area(£(x)) = 2 Area(§ M ~ 2 ) / an M ~ 2 (pd(p. (19) 

Jo 

Next, we can substitute t = cos<p to get 

/ sin M "Vd^= / sm M - 3 <psm<pd<p = / (l-t 2 )^dt. (20) 
Jo Jo J cose 

Note that the area of S A/_1 is given by replacing 9 with | in (fl9l) and (|20|) . and so 

Area(E(x)) _ Ql-t 2 )^dt 
Area(§^- 1 ) " £(1 dt ' 

Substituting u = t\jM — 3 and recalling that cos 2 6* = S results in new integrals which 
converge as M grows large: 



VM=3 / 2 M 2x M -~ 3 



Area(5(x)) _ ./cVt? V M - 3 2 



du 



Area(§ M - 1 ) rVn=3 



2 v M-3 
2 



1--^-) ' d M 

Specifically, since (1 + converges from below to e x for all x > 0, we can apply the 
Lebesgue dominated convergence theorem to the Gaussian to obtain 

Area(£(x)) Jg° e^ 2 ^ 
Area(§ A/ - 1 ) e- 2 /Mu ^ [ >' 

This implies that as M grows large, our assumption (iT7|) guarantees ( 1T81) . as needed. □ 

As a corollary to Theorem [Til note that if pm — > 1 as M gets large, then the worst-case 
condition number diverges to infinity. Specifically, this establishes that M x N full spark 
frames with M = o(N) cannot be "maximally robust to erasures" in a numerical sense; for 
sufficiently large M, the adversary can delete N — M columns of the frame in a way that 
leaves an arbitrarily ill-conditioned square submatrix. This highlights the value of a theory 
of numerically erasure-robust frames. 
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4. Implications and remaining problems 

Having constructed several numerically erasure-robust frames, and having further proved 
certain fundamental limits, we conclude with a few interesting observations. First, we con- 
sider an implication for maximal ETFs: no M x N (p, C)-NERF can have (1 — p)N zeros 
in a common row, since otherwise the adversary can delete the other pN columns and leave 
a rank-deficient submatrix. Since Theorem H] also applies to maximal ETFs, this implies 
that there is no basis over which half of a maximal ETF's vectors share a common zero 
coordinate. That is, if maximal ETFs exist, then they cannot be too sparse in any basis. 

Due to their computational benefits, frames which have a sparse representation have re- 
cently become a subject of active research [HI [TO]- In this vein, one attractive feature of 
Steiner ETFs is their naturally sparse representation; in fact, the proportion of nonzero en- 
tries in an MxN Steiner ETF is 0(M~ 1 / 2 ) [15J. However, no Steiner ETF can be maximal, 
for they have at most N = 0(M 3 / 2 ). The work presented here reinforces this fact: since 
noMxJV (p, C)-NERF can be very sparse, and since ETFs with N = Q(M 2 ) are NERFs 
by Theorem we see that neither Steiner ETFs — nor any generalization of the Steiner 
construction with similar levels of sparsity — will ever be able to produce ETFs in which 

n = q(m 2 ). 

Recall that MxN full spark frames have the defining property that every subcollection 
of M columns spans; trivially, this implies that every subcollection of size at least M also 
spans. By analogy, it is natural to ask whether a (p, C)-NERF is also a (p', C)-NERF for 
every p' e [0,p). However, it is not clear whether this is the case, since deleting columns does 
not necessarily worsen a frame's conditioning. As an example, the union of an orthonormal 
basis with some unit vector is not as well conditioned as the orthonormal basis which survives 
the deletion of the last vector. While this open question is interesting, it is inconsequential 
in practice: If the adversary deletes less than pN of the frame vectors, we can neglect more 
of them to guarantee a well-conditioned subframe. 

Another remark: Reviewing the results of this paper, we know there exist NERFs with 
p < \ by Theorem HI Meanwhile, Theorem [11] states that for any fixed C, there do not exist 
NERFs with values of p that grow arbitrarily close to 1. Various questions remain: Do there 
exist NERFs with p e [|, 1)? If so, what is the largest p for which (p, C)-NERFs exist? 
Interestingly, this "one-half barrier" appears to be more than a mere artifact of Theorem HI 
To be clear, every matrix F whose entries are ±l's cannot be a NERF with p > |; for any two 
rows of F, the corresponding entries are either equal or opposite, and so the adversary can 
delete the columns corresponding to the less popular relationship and leave a rank-deficient 
matrix. Moreover, random matrix methods [6j [21] apply to matrices of ±1 entries without 
loss of effectiveness, and so breaking the one-half barrier, if it is even possible, will likely 
require other methods. 
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